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Abstract 

The optimal mass transportation was introduced by Monge some 200 years ago and 
is, today, the source of large number of results in analysis, geometry and convexity. 
Here I investigate a new, surprising link between optimal transformations obtained by 
different Lagrangian actions on Riemannian manifolds. As a special case, for any pair of 
non-negative measures A + , A~ of equal mass 

W 1 (\-,X + ) = lime^inf W P U + s\~ , + s\ + ) 

e— >0 M 

where W pi p > 1 is the Wasserstein distance and the infimum is over the set of probability 
measures in the ambient space. 

1 Introduction 

The Wasserstein metric W p (oo > p > 1) is a useful distance on the set of positive Borel 
measures on metric spaces. Given a metric space (M, D) and a pair of positive Borel measures 
A± on M satisfying f M dX + = f M dX : 



H^(A + ,A~) :=inf 



M JM 



D p (x,y)dir(x,y) 
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where V{X + , A~) stands for the set of all positive Borel measures on MxM whose M— marginals 
are A + , A - . 

Under fairly general conditions (e.g if M is compact), a minimizer ir° S V(X + , X~) of 
(jl.ip exists. Such minimizers are called optimal plans. I'll assume in this paper that M is 
a compact Riemannian manifold and D is a metric related (but not necessarily identical) to 
the geodesic distance. 

If in addition A + satisfies certain regularity conditions, the optimal measure ir° is sup- 
ported on a graph of a Borel mapping ^ : M — > M. By some abuse of notation we call a 
Borel map \& an optimal plan if it is a minimizer of 



w p (\ + ,\-) 



L> p (x,$(x))ciA H 



; * # A+ 



A" 



(see Section [1.2l Til for notation). 

The metric W p , p > 1 is a metrization of the weak topology C*(M) on positive Borel 
measures. In particular, it is continuous in the weak topology. Thus, it is possible to approx- 
imate W P (X + , A~) (and the corresponding optimal plan) by W P (X^, A^) on the set of atomic 
measures 



X% e M + ' N -.= | n = m i 6 (xi) ,™i>0, xi e M | 



N 



oo 



(1.2) 
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reducing into a finite- dimensional linear programming on the set of non-negative N x N 
matrices {Vij} subjected to linear constraints. 

There is, however, a sharp distinction between the case p > 1 and p = 1. If p > 1 then 
the optimal plan ir° is unique (for regular A + ). This is, in general, not the case for p = 1. 
Another distinctive feature of the case p = 1 is its "pinning property": The distance W\ 
depends only on the difference A := A + — A~. This is manifested by the alternative, dual 
formulation of W%: 

^i(A) =sup|y^A ; WLip<l| (1.3) 
where \\4>\\ L ip ■= sup x ^ yeM {4>{x) - 4>{y)) /D(x,y). 

The optimal potential yields some partial information on the optimal plan \& (if exists). 
In particular, X7(J)(x), whenever exists, only indicants the direction of the optimal plan. For 
example, if the metric D is Euclidian, then *f>(x) = x + t(x)V</>(x) for some unknown t(x) £ 
M + . This is in contrast to the case p > 1 where a dual variational formulation, analogous to 
(|1.3p , yields the complete information on the optimal plan \& in terms of the gradient of some 
potential <j). 

In this paper I consider an object called the p— Wasserstein distance (p > 1) of A + to A - , 
conditioned on a probability measure \i: 

W^\\\ii) :=sup|y^A ; J \V(p\ q dfi < 1^ (1.4) 

where q = p/(p — 1). 
The first result is 

W 1 {X)=mmlw^(X\\fi) ; J dfi = lj , (p > 1) (1.5) 

The problem associated with (ll.5p is related to shape optimization, see j7]- In addition, the 
minimizer /i in (|1.5p and the corresponding maximizer in (|1.4p or (jl.3p play an important 
rule in the Li theory of transport [12] . In fact, the optimal <j) is, in general, a Lipschitz 
function which is differentiable \x a.e. and satisfies |V0| = 1 /J. a.e. The minimal measure /U is 
called a transport measure. It verifies the weak form of the continuity equation which, under 
the current notation, takes the form 

V • (/xV0) 



Wt(X) 



The transport measure yields an additional information on the optimal plan ^ along the 
transport rays which completes the information included in [E]. In the context of shape 
optimization it is related to the optimal distribution of conducting material [7] . See also [19] , 
[23], [24]. 

The evaluation of the transport measure /i is therefore an important object of study. It is 
tempting to approximate the transport measure as a minimizer of (jl.5p on a restricted finite 
space, e.g. for fj, £ M +,N as defined in (jl.2|) . 
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However, this cannot be done. Unlike W p , W^ p \X\\fi) is not continuous in the weak 
topology of C* on Borel measures with respect to both fi and A. Indeed, it follows easily that 
W\ (A||/i) = oo for any atomic measure ji. 

The second result of this paper is 

W^\XU) = lim nW p (u + \ + /n, + \~ /n) (1.6) 

Here the limit is in the sense of V convergence. A somewhat stronger result is obtained if we 
take the infimum over all probability measures //: 

WAX) = lim nmmW p (n + X + /n,^i + A~/n) (1.7) 

where the convergence is, this time, pointwise in A. 

The importance of ((LSI E3) is that W^(X\\/i) can now be approximated by a weakly 
continuous function 

Wi p) (X + ,X-\\fi) :=nW p (fx + X + /n,ti + X~ /n) . 

Suppose no is a unique minimizer of (|1.5p . If fi n is a minimizer of wi p) (X + ,X~\\fj,) then the 
sequence {fJ. n } must converge to the transport measure (J,q. In contrast to W^ p \ Wn is 
continuous in the C* topology with respect to fi. Hence [i n can be approximated by atomic 
measures ^ G M +,N (|1.2p . In particular the transport measure can be approximated by a 
finite points allocation obtained by minimizing Wn P ^ on M +,N for a sufficiently large n and 
N, 

The results (11.51 - 1 1 . 7f) can be extended to the case where the cost D p on M x M is gener- 
alized into an action function on a Riemannian manifold M x M, induced by a Lagrangian 
function I : TM — > R. This point of view reveals some relations with the Weak KAM Theory 
dealing with invariant measures of Lagrangian flows on manifolds. 

1.1 Overview 

Section [2] review the necessary background for the Weak KAM and its relation to optimal 
transport. Section [3] state the main results (Theorems [IH]) , which correspond to (|1.51 11.7]) 
for homogeneous Lagrangian on M x M. Section H] presents the proof of the first of the 
main results which generalizes (II. 4h . Finally, Section [5] contains the proofs of the other main 
results which generalize (|1.6l II. 7[) . 

1.2 Standing notations and assumptions 

1. (M,g) is a compact, Riemannian Manifold and D : M x M — > M + is the geodesic 
distance. 

2. TM (res. T*M) the tangent (res. cotangent) bundle of M. The duality between 
v G T X M and p G T*M is denoted by (£, v) G R. The projection n : TM -> M is the 
trivialization n(x,w) = x. Likewise n* : T*M — >• M is the trivialization n*(x,^) = x. 
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3. For any topological space D, A4(D) is the set of Borel measures on D, A4o(D) C M(D) 
the set of such measures which are perpendicular to the constants. M. + (D) C Ai(D) 
the set of all non-negative measures in M, and Mf(D) C M. + (D) the set of normalized 
(probability) measures. If D = M, the parameter D is usually omitted. 

4. A Borel map <J> : D\ — > D2 induces a mapping : M. + (Di) — > A4 + (-D 2 ) via 

^ # {^)(A) = ^- l (A)) 

for any Borel set A C Z?2- 

5. For any 2, y G M let K% y be the set of all absolutely continuous paths z : [0, T] — » M 
connecting x to y, that is, z(0) = cc, z(T) = y. 

6. Given /xi,/X2 £ A4 + , the set V{ni-, H2) is defined as all the measures A G ,M + (M x M) 
such that tti #A = /ii and vr 2j #A = ^2, where 7T, : A4 x — » A4 defined by 7Ti(x, y) = x, 
7r 2 (x,y) = y. 

7. An hamiltonian function h G C 2 (T*M;]R) is assumed to be strictly convex and super- 
linear in £ on the fibers T*M, uniformly in x G M, that is 

h(x,£)>-C + h(€) where lim = °° . 

ll£IH°o 

The Lagrangian I : TM — > R is obtained by Legendre duality 

Z(a:,u) = sup - hfof) 

satisfies Z G C 2 (TM;R), and is super linear on the fibers of T X M uniformly in x. 

8. Expm : TM x R — » M is the flow due to the Lagrangian / on M, corresponding to the 
Euler-Lagrange equation 

±1 -I 
at 

For each i G R, Exp^ : TM — ► M is the exponential map at time i. 

2 Background 

The weak version of Mather's theory [20] deals with minimal invariant measures of La- 
grangians, and the corresponding Hamiltonians defined on a manifold M. In this theory 
the concept of an orbit z = z(t) : R — > M is replaced by that of a closed probability measure 
on TM: 



A4g := jz/ G A4f (TM) ; ^ (d<f>,v)dv = for any G C 1 (M)| . 
A minimal (or Mather) measure um G A4§ is a minimizer of 



f2.ll 



inf / l(x,v)du(x,v) := -E (2.2) 
"6^5 Jtm 
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It can be shown ([2], [IS], [3J) that any maximizer of (|2.2|) is invariant under the flow induced 
by the Euler-Lagrange equation on TM: 

—Vil(x,x) = VJ(x,x). (2.3) 

There is also a dual formulation of (12, 2j) [17J, [29J: 

sup inf / h(x,d(j))dfi = E , (2.4) 

where the maximizer /ij^f is the projection of a Mather measure vm on M. The ground energy 
level E, common to (|2.21 12. 4[) . admits several equivalent definitions. Evans and Gomes ( |llj 
|13j |14j ) defined ,5 as the effective hamiltonian value 

E:= inf sup h(x, d(f>) , 

<P£CHM) X&M 

while the PDE approach to the WKAM theory ([IS]) [E]) defines E as the minimal E G K 
for which the Hamilton- Jacobi equation ci^>) = S admits a viscosity sub-solution on M. 
Alternatively E is the on/?/ constant for which h(x,d(f>) = E admits a viscosity solution [15] . 
There are other, equivalent definitions of E known in the literature. We shall meet some of 
them below. 

Example 2.1. i) I = Ik '■= \v \ p /(p — 1) where p > 1. Here E = and hm is the volume 
induced my the metric g. 

ii) l(x,v) = (l/2)\v\ 2 — V(x) where V E C 2 (M) (mechanical Lagrangian) . Then E = 
max xg M V(x) and /jlm of \2.1$ is supported at the points of maxima ofV. 

Hi) l(x,v) = Ik(v — W(x)) where W is a section in TM. 

Then 12. implies E < 0. In fact, it can be shown that E = for any choice of W. 

iv) In general, if P is in the first cohomology of M (H}(M)) then I i— > l(x,v) — (P,v) 
induced the hamiltonian h i— >• h(x,£ + P) and E = a{P) corresponds to the celebrated 
Mather (a) function [20] on the cohomology H 1 (M). See also \21^ . 

The Monge problem of mass transportation, on the other hand, has a much longer history. 
Some years before the the French revolution, Monge (1781) proposed to consider the minimal 
cost of transporting a given mass distribution to another, where the cost of transporting a unit 
of mass from point x to y is prescribed by a function C(x, y). In modern language, the Monge 
problem on a manifold M is described as follows: Given a pair of Borel probability measures 
Ho, Hi on M, consider the set )C(ho,Hi) °f an Borel mappings $ : M — > M transporting Ho 
to Hii i- e 

$ G K(ho,Hi) $#Ho = Hi 
and look for the one which minimize the transportation cost 

C(j4),pi):=wi(J C(x,$(x))d/Mi(x) ; $ € £(a*o,Mi)J ■ ( 2 - 5 ) 
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In this generality, the set K(/J<o, /Ui) can be empty if, e.g., contains an atomic measure, so 
C(no,fj,i) = oo in that case. In 1942, Kantorovich proposed a relaxation of this deterministic 
definition of the Monge cost. Instead of the (very nonlinear) set /C(/Xo,//i), he suggested to 
consider the set V(no, defined in section fl~2l -(j6"|). Then, the definition of the Monge metric 
is relaxed into the linear optimization 

C(/xo,A*i) = mhJ / C(x,y)dA(x,y) ; A G V(po, m) \ ■ (2.6) 
A I J MxM ) 

Example 2.2. The Wasserstein distance W p (p > 1) is obtained by the power p of the metric 
D induced by the Riemannian structure: 



W p (no,Hi) = mm 



D p (x,y)dA{x,y) 

MxM 



VP ) 

■AGP( W)/1 i) (2.7) 



The advantage of this relaxed definition is that C(/io,/^i) is always finite, and that a 
minimizer of (|2.6p always exists by the compactness of the set V(iiq, /ii) in the weak topology 
C*(M x M). If fiQ contains no atomic points then it can be shown that C(no, Hi) 1 s given by 
(123]) and (gSJ coincide [J. 

The theory of Monge-Kantorovich (M-K) was developed in the last few decades in a 
countless number of publications. For updated reference see |12j . [28]. 

Returning now to WKAM, it was observed by Bernard and Buffoni ([lj[5]- see also [29 ) 
that the minimal measure and the ground energy can be expressed in terms of the M-K 
problem subjected to the cost function induced by the Lagrangian (recall section [L2l l5jl 

C T {x,y) :=inf j J I (z(s); z(s)) ds , z € KgA ,T > . (2.8) 
C T (ji) := C T (fj,,(j,) = mini / C T (x,y)dA(x,y) ; Ae%/i) 

A iJMxM ) 

and 

min {Ct(jj) ; H G Mf} = -TE (2.9) 



Then 



where the minimizers of (|2.9p coincide, for any T > 0, with the projected Mather measure 
HM maximizing (12. 4p [5]. 

The action Ct induces a metric on the manifold M : 

(x,y) £ AfxMn D E (x,y) = MQr(x,y)+TE . (2.10) 



1 By convention, the name "Monge problem" is reserved for the metric cost, while "Monge-Kantorovich 
problem" is usually referred to general cost functions 
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Example 2.3. 

i) For l(x, v) = \v\ p / (p — l),p> 1 we get Ct(x, y) = D(x, y) v I (p — 1)T P_1 while D E (x, y) = 

pE 1 ~ 1 / p D g (x,y)/(p — 1) ifE>0, D E (x,y) = -oo if E < 0. 

ii) l(x,v) = (l/2)|f| 2 — V(x) where V G C 2 (M) (mechanical Lagrangian) . Then D E (x,y) 

is the geodesic distance induced by conformal equivalent metric (M, (E — V)g) on M , 
where E > E_ = sup A / V. 

It is not difficult to see that either D E (x,x) = for any x G M, or DE(x,y) = — oo for 
any x, y G M. In fact, it follows ([22], [313]) that D E (x, y) = — oo for E < E and D E (x, x) = 
for E > E and any x,y £ M. 

Let now A + , A~ G A1 + where A := A + — A~ G .Mo, that is J M dX = 0. Let 

V E {\) :=V E (\+,\-) =min{ / D E (x,y)dA(x,y) ; A G 7-*(A)l (2.11) 

be the Monge distance of A + and A~ with respect to the metric D E . There is a dual formu- 
lation of D E as follows: Consider the set C E of D E Lipschitz functions on M: 

C E :={cj> G C(M) ; 0(x) - <j>{y) < D E (x, y) V x, y G M} (2.12) 

Then (see, e.g [12]. [25]) 

V E (\)=max!^J 0dA ; ^ € £js.| (2.13) 

3 Main results 

The object of this paper is to establish some relations between the action Ct and a modified 
action Ct defined below. 

3.1 Unconditional action 

For given A G Aio we generalize (I2.ip into 

M% := \u G Mf(TM) ; / (d<f>,v)di> = f 4>d\ for any G C 1 (M) 1 (3.1) 
and define 

C(X):=m\ f l(x,v)du(x,v) ; i/eA^j. (3.2) 
" Utm J 

The modified action Ct : A4q — » M U {oo}, T > have several equivalent definitions as 
given in Theorem [1] below: 

Theorem 1. T/ie following definitions are equivalent: 
1. C T (X) := TC (A) . 
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2. C T {\) := min^sup^ {f M -Th(x,d(f>)dfi + (pdX ; y, G Mf , cj)eC 1 (M)} . 

3. C T {\) := maxBXE [2^ (A) - ET] . 

In addition if T c := Dg(\) < oo then for T > T c , 

C T {\)=C Tc {X)-TE. 
In that case the minimizer G Mfof (T^j). T > T c is given by 

Ma = y^a c + ( 1_ 7p I MM , 
where fiM is the projected Mather measure. 

Remark 3.1. Note that T>e(X) \2.11[ [2~TB]) is a monotone non- decreasing and concave func- 
tion of E while De(X) > — oo by definition. Hence the right- derivative ofD^(X) as a function 
of E is defined and positive (possibly +oo at E = E). 

Remark 3.2. A special case of Theorem^ was introduced in ]30$ . 

For the next result we need a technical assumption H, introduced above Lemma 15.51 

Theorem 2. Assume H. For any A = A + — A~ where A^ £ ■M-x , 

Ct(A) = lim min e _1 C e r(M + eA~, \x + eA + ) . 

Remark 3.3. H holds, in particular, for any mechanical hamiltonian and for an homoge- 
neous space M, e.g the fiat d-torus R d /Z n or the sphere S^ 1 = SO(d)/SO(l). 

As an application of Theorem [2] we may consider the case where the lagrangian I is 
homogeneous with respect to a Riemannian metric gr x y. 



Example 3.1. Ifl(x,v) = \v \ p /(p — 1) where p > 1. then Ct(x,h) = (j^iyjy-i while 
D E (x,y) = -^ l E^- 1 ^D{x,y) andE = 0. It follows that 

where the Wasserstein distance W p is defined in \2. 7|j. Hence, by Theorem^ and Theorem^ 



Wi(A) = lime _1 inf W p (/x + e\~ , \i + eA + ) . 

Remark 3.4. T/te optimal transport description of the weak KAM theory \2. 9\) can be con- 
sidered as a special case of Theorem^ where A = 0. Indeed inf Lt&A/( + e~ 1 C £ T(y, y) = —TE_ 
by $2.9\) . On the other hand, since 2?e(0) = for any E > E it follows that T c = 0, hence 
Ct c (0) = so Ct(0) = —TE_ as well by the last part of Theorem^ 
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3.2 Conditional action 

There is also an interest in the definition of action (and metric distance) conditioned with a 
given probability measure [i € M\ . We introduce these definitions and reformulate parts of 
the main results Theorems Q312] in terms of these. 
For a given and E > E_, let 

H E {li) -=U> E C^M) ; j h(x,d(f>)dfi < #j . (3.4) 

In analogy with (|2.13p we define the \i— conditional metric on A € 

2?b(A||m) :=sup( f 4>d\ ; G H E (fx)\ . (3.5) 
<f> Um J 

The conditioned, modified action with respect to /J, € .M^ is defined in analogy with 
Theorem (HP [3D 

Cr(A||/i) := maxI>K(A||M) - KT = sup / -Th(x,d<t>)d/j. + 4>d\ . (3.6) 



Example 3.2. >ls in Example \3.1\ l(x,v) = \v\ p /(p — 1) implies /i(£) = q where q = 

p/(p - 1). r/ien Q (U) is re/atod to {Tg ), £/jat is W^AH/z) = V E (X\\n) w/iere £ = g" 9 
or 

2? b (A||m) = } (A||/x) , C T (A||/z) = (^(AH/x))" (3.7) 

Remark 3.5. It seems there is a relation between this definition and the tangential gradient 
J3J/. There are also possible applications to optimal network and irrigation theory, where one 
wishes to minimize D(X\\fi) over some constrained set of fj, £ Mf (the irrigation network) 
for a prescribed A (representing the set of sources and targets). See, e.g. fS^, f^j and the ref. 
within. 

The next result is 

Theorem 3. For any A € M.q, 

V E {\) = min V E {\\\n) , C T (A) = min C T (A||^) . 

The analog of Theorem [2] holds for the conditional action as well. However, we can only prove 
the T— convergence in that case. Recall that a sequence of functionals F n : X n ->KU {00} is 
said to T— converge to F : X —* R U {00} (T — linin^oo F n = F) if and only if 

(i) X n C X for any n. 

(ii) For any sequence x n € X n converging to x £ X in the topology of X , 

liminf F n (x n ) > F(x) . 

n—>oo 
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(iii) For any x € X there exists a sequence x n € X n converging to x € X in the topology of 
X for which 

lim F n (x n ) = F(x) . 

n^oo 

In Theorem [J] below the T— convergence is related to the special case where X n = X: 
Theorem 4. Let X„, = Mo x M± = X and F n (X, fi) := nC^/ n (^ + X~/n, fi + A + /n). Then 

Ct(-||-) = T- lim F n . 

n-^oo 

In addition, if [i n is a minimizer of F n inMf then any converging subsequence of \i n , n — > oo, 
converges to a minimizer o/C(A||-) in Mf. 

Since Ct is continuous in the weak topology on Mf we obtain that a minimizer of C(A 
can be approximated by points allocation: 

Corollary 3.1. Let M +,N given by and [i 3 n a minimizer of C T / n (/u + X + /j, n + X" /j) 

in M + ' N . Then any limit of linin—xx, limj^oo (J n is a minimizer ofC(-\\X). 

Finally, we note that (jl.7p is a special case of Theorem [H Using Examples 13. H 13.21 with 
e = 1/n, recalling (q — 1) _1 = p — 1 we obtain 

Corollary 3.2. 

Wi(A) = lim n min W p (fi + X + /n, fi + A~/n) 

4 Proof of Theorems 

We first show that C(X) < oo (recall ([32])). 

Lemma 4.1. For any A G Mo, M c x ^ 0. in particular, since the Lagrangian I is bounded 
from below, C(A) < oo. 

Proof. It is enough to show that there exists a compact set K C TM and a sequence {A n } C 
Mo converging weakly to A such that for each n there exists v n £ M\ whose support is 
contained in K. Indeed, such a set is compact and there exists a weak limit v = lmin^oo v n 
which satisfies limn^oo vu n = vv as well. Hence, if <p G C 1 (M) then 

lim / (d4>,v)du n = / {dcj),v)dv , lim / </>o!A ra = / (/>dA n . 
n ~*°°JM Jm n ^°°JM Jm 



Since f„£M we get 



(d(j),v)dv n = I (f)dX n 
m Jm 



for any n, so the same equality holds for v as well. 
Now, we consider 



\n = a n ^2(6 Xj -5 yj ) (4.1) 
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where Xj,yj G M and a n > 0. For any pair (xj,yj) consider a geodesic arc corresponding to 
the Riemannian metric which connect x to y, parameterized by the arc length: Zj : [0, 1] — »■ M 
and \z\ = D(xj,yj) (recall section [L2l (fT])). Then 



e/ 

j=l J ° 

satisfies for any <j> G C 1 (M) 

/ (d(/),v)dv n = a n y2 (d(f>(zj(s),Zj(s))zj(t))dt = a n y2 -j-<f>{zj(s)) dt 
Jm j=i Jo j^i Jo 

n ,. 

= a n V - 0(^)1 = / 4>d\ n (4.2) 

hence v n G . Finally, we can certainly find such a sequence A n of the form (|4.ip which 
converges weakly to A. □ 

Proof of Theorem ffl jT]^2| 

First we note that it is enough to assume T = 1. Consider 



jrfafy.- / -h(x,d<p)dfi + (/)d\ (4.3) 
A/ 

where A G .Mo is prescribed. Evidently, J 7 is convex lower semi continuous (l.s.c) in fx on 
.M^ and concave upper semi continuous (u.s.c) in <fi on C 1 (M). Since .M^ is compact, the 
Minimax Theorem implies 



sup min <f>) = min sup <ft) ■ (4-4) 

Next define 



fPec 1 (M) tieMf neMJ ^ec 1 (m) 



9(v,<ji):= (l(x,v) - (d<j),v))dv+ / <£dA . 
in/ Vm 



on Mt(TM) x C 1 (M). Then (recall <^J 



sup inf G{v,4>)< inf / /(x, u )di> = C(A) . (4.5) 

</)GC 1 (M) ueM^(TM) v ^ M UtM 

Now 

i f TM l(x,v)dv Hue MX 

Q(y):= sup ^ = J J ™ v y 



We recall, again, from the Minmax Theorem that the inequality in (j4.5h turns into an equal- 
ity provided the set {y G M\ (TM); ^(^) < C(A)} is compact. However C{v) < oo by 
Lemma 14.11 Since I is super linear in v uniformly in x (see section I1.21 I71) it follows that the 
sub-level set \y G M c x ; J TM l(x,v)du < C < oo} is tight for any constant C, hence compact. 
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Next 

(l(x, v) — (d(f), v)) dv(x, v ) + / (f)d\ 

TM Jm 

4>d\ — h(x,d(f>)dfi + / (l(x, v) — (dxf), v) + h(x, d(f>)) dv(x, v). (4.6) 

M JTM 

where \x = n#i/. By the Young inequality l(x,v) + h(x,£) > (£,v)r x \ for any £ G T*M, 
v G T X M with equality if and only if v = h%(x, dcj)(x)). So, the second term on the right of 
(|4,6p is non-negative, but, for any fi G M.\ 



V 



rail I (l(x,v) - (#,«)) dv{x,v) ; v £ M{(TM) ,IL#i> = = - h(x,d<j))dn 



is realized for v = (^-^j (x,d<£(a;)) © £ jM|(TM). From this and (|4.6p we obtain 

inf Q(u,<p)= inf T{4>,n) 

ueM+(TM) fieM + 

hence 

sup inf Q(v,4>) = sup inf T((f>, fi) = C(X) 
4>eG 1 (M)veMf(TM) 4>eC 1 {M)neMf 



and this part of the Theorem follows from 
□ 

We now turn to the proof Theorem [T]-([2H=>CI]). 

For £gK, let o~e ■ TM — ► R the support function of the level surface h(x, £) < £7, that is: 

<te(x,v) : = sup {(£,i>)(a.) ; < -E} • (4.7) 

It follows from our standing assumptions (Section ll,2l l7|) that oe is differentiable as a function 
of E for any (x,v) G TM. For the following Lemma see e.g. [25J: 

Lemma 4.2. . Recall that 

D E (x,y):=M Q C T (x,y) + ET (4.8) 
where Ct as defined in \2. 8\) . Then (recall section [T^W 5\) 

D E (x,y)= inf I a E (z(s), z(s)) ds . (4.9) 

Z&Qs.y JO 

Given x G M, let 

£ := inf {E 1 G R; D^x, z) > -oo} (4.10) 
For the following Lemma see |21| (also |27j): 



Lemma 4.3. E is independent of x G M. T/ie definitions {4-10 ) and \2. 2\) and \2.J$ are 



equivalent. If E > E then D E (x,y) > — oo for any x, y G M and, in addition 
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i) De(x, x) = for any x G M. 

ii) For any x,y,z G M, De(x, z) < De(x, y) + DE{y, z) 

From (|4.8|) . Lemma |4, 21 and the continuity of <7e with respect to E > Ewe get 

Corollary 4.1. If E > E then for any x,y G M, DE(x,y) is continuous, monotone non- 
decreasing and concave as a function of E. 

Note that the differentiability of oe with respect to E does not imply that De(x,jj) is 
differentiable for each x,y G M. However, since DE(x,y) is a concave function of E for 
each x, y G M, it is differentiable for Lebesgue almost any E > E_. We then obtain by 
differentiation 

Lemma 4.4. If E is a point of differentiability of DE(x,y) then there exists a geodesic arc 
z G K-x tV realizing such that the E derivative of DE(x,y) is given by 

d f 1 

T E (x, y) := -^D E (x, y) = J a E (z(s), z(s)) ds , (4.11) 

where o~' E is the E derivative of aE- Moreover 

D E {x,y) = C TE{Xty) (x,y) + ET E {x,y) . (4.12) 

From (|4.7p we get o~e{x,v) < |u|max{|p| ; h(x,£) < E}. From our standing assumption 
on h (section [1.2l -(j7|)) and (|4.9p we obtain 

Lemma 4.5. For any x,y G M and E > E 

D E (x,y) < h- l {E + C)D{ Xl y) 

In particular 

lim E~ x De(x, y) = (4.13) 

E^oo 

uniformly on M x M. 

Corollary 4.2. The set Ce \2.12i) is contained in the set of ' Lipschitz functions with respect 
to D, and Ce is locally compact in C(M). 

Given x,y G M, let E be a point of differentiability of DE{x,y), and z xy : [0, 1] — ► M a 
geodesic arc connecting x,y and realizing (|4.1ip . Then dr x y := a E {^z xy ,z xy ) ds is a non- 
negative measure on [0, 1], and Tg;(x,y) = f dr^ y is compatible with (I4.11|) . Let ^ y be the 
measure on M obtained by pushing t% from [0, 1] to M via z x y : 

E ._ IE \ E F ka+ 

that is, for any G C(M), 

[ ^% y := f <f> (z% y (tj) dT* y , (4.14) 

JM JO 
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Given (f> E C l {M) let 

H(4>) := sup h(x,d(f)) . (4.15) 

We extend the definition of H to the larger class of Lipschitz functions by the following 
Lemma 4.6. If 4> £ C X (M) then 

H{4>) = min{£; <j) E £e} , 

E^> E 

where He as defined in \2.12^i . 

Proof. First we show that if <p E Ce H C 1 (M) then /i(x, d<^>) < i£ for all x € M. Indeed, for 
any x,y E M and any curve z(-) connecting x to y 

<p(y) - <f>(x) = [ d(f>(z(t)) ■ zdt < D E {x, y)< f a E (z(t), z(t))dt 
Jo Jo 

hence dcj)(x) ■ v < cte(x,v) for any v E T X M. Then, by definition, d(j)(x) is contained in any 
supporting half space which contains the set Q X {E) '■= {£. E T*M; h(x,£) < E}. Since this 
set is convex by assumption, it follows that d<fi E Q X (E), so h(x,d(f>) < E for any x E M. 
Hence ~H{4>) < E. 

Next we show the opposite inequality h{x,d<p) > E for all x E M. Recall ()4.12p . Then 
for any e > we can find T £ > and z £ E JC x ^ y so 

D E (x,y) > r l(z £ (t),z £ (t))dt + (E-e)T £ . (4.16) 
Jo 

Next, for a.e t E [0, T e ] 

h (z £ (t),dcf>(z £ (t))) > *&) ■ dcf>(z E (t)) - I (z £ (t), -z £ {t)) . (4.17) 

Integrate KW\ from to T £ and use z £ E K%* y , (14TT61 [4TT7]) and the definition of £ E to 
obtain 

T- 1 r h (z B (t), #(z e (t))) (it > T^ 1 [</>(y) - 0(x)] - T- 1 I (z £ (t), -z £ {t)) dt>E-e . 
Jo Jo 

Hence, the supremum of h(x, d<p) along the orbit of z £ is, at least, E — e. Since e is arbitrary, 
then H((p) > E. □ 

From Lemma [4.61 and Corollary 14.21 we extend the definition of H to the space Lip(M) of 
Lipschitz functions on M. Let now define 

H* T (X):= sup j -TH(<f>) + [ cpd\\ GRU {cxd} . (4.18) 
<j>eLip(M) I Jm J 

Proposition 4.1. For any X E Mo 

H* T (X) = sup {V E {X) - TE) . (4.19) 

E>E 
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Proof. By definition of H and Lemma 



sup 

<j>£Lip(M) 



4>dX - TH( 

M 

sup sup 

E>E<f>£Lip{M) 



sup sup 

b&Lip{M) E>E 



<t>d\-TE\ 4> E C E 



M 



4>dX-TE ; 4> e 



sup {D S (A) - T£} , (4.20) 

□ 



where we used the duality relation given by (|2.13p . 
Corollary 4.3. H T is weakly continuous on M.q. 

Proof. For each E > E, the Monge-Kantorovich metric T>e '■ -Mo — > M. is continuous on Aio 
(under weak* topology). Indeed, it is u.s.c. by (j2.11[) and l.s.c. by the dual formulation 
(I2TT3D . 

Also, for each A G M.~{, T>e{X) is concave and finite in E for E > E. It follows that T> is 
mutually continuous on [E_, oo[x M®. From (|4,13p we also get that T> is coercive on A4o, that 
is lim^^oo E~ 1 T>e{X) = locally uniformly on M.q. These imply that H T is continuous on 
M via (I4TT91) . □ 

We return now to Corollary 14.11 and Lemma I4.4L It follows that for any countable dense 
set A C M there exists a (possibly empty) set iV c]E, oo[ of zero Lebesgue measure such 
that De(x,u) is differentiable in E e]E,oo[—N, for any x,y £ A. Let .M(-A) C M.q be the 
set of all measures in A4q which are supported on a finite subset of A, and such that A({j;}) 
is rational for any x € A. Again, since A4(A) is countable, it follows by Corollary 14.11 that 
T>e(X) is differentiable (as a function of E) for any A G «M(vl) and any E €]E_,oo[—N for 
a (perhaps larger) set of zero Lebesgue measure. It is also evident that A^o is the weak 
closure of A4(A). 

Lemma 4.7. For any A + — A~ = A € A4(A) and E £]E_, oo[—N, there exists an optimal plan 
A^ G "P(A + ,A~) realizing 



'MxM 

for which 



D E {x,y)dA E (x,y) 



mm / D E (x,y)dA(x,y) =V E (X) 



— V E {X)= Y, A E ({x,y})TE(x,y) 



(4.21) 



(4.22) 



x,y£A 



Proof. Let E n \ E. For each n, set A^ be a minimizer of (|4.2ip subjected to E = E n . We 
choose a subsequence so that the limit 



A x E+ ({x,y}) := lim A X En ({x,y}) 



(4.23) 



exists for any x,y € A. Evidently, A^, + £ V(X + , X ) is an optimal plan for (|4.2ip . Next, 
D En (X)-V E (X)> A-E n ({x,y})(D En (x,y)-D E (x,y)) 

x,y&A 
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Divide by E n — E > and let n — > oo, using (|4.23|) and (|4.11|) we get 



^V E (X)> J]) ^ E+ {{^y})T E {x,y) . 

x,y£A 



(4.24) 



We repeat the same argument for a sequence E n /* E 1 for which 



A*_({x,y}) := lim A| n ({a;,y}) 

n^oo 



and get 



^e(A)< A|-({x,y})r £ (x,y) . 



(4.25) 



Again A^_ is an optimal plan as well. If A^_ = A^ + then we are done. Otherwise, define 
A^,_ as a convex combination of A^,_ and A^, + for which the equality (|4.22p holds due to 

Definition 4.1. For any A G M(A) and E e]E,oo[-N let 



Ma 



:= ^ Ai({a;,y})^ v 



where fi^ y are as given in h4-14\ ) and K E is the particular optimal plant given in Lemma \4- 1\ 

Remark 4.1. Note that J M dfif = D' E {\) for any A G Mq{A) and E e]E, oo[— N by 
Lemma\4~77\ where D E (X) = (d/dE)V E (X). 

Definition 4.2. For any A G Mq, T > 0, E(X,T) is the maximizer of \J^.19 ), that is 

D E (x,T)W -TE(X,T) =H* t (\) . 
By Corollary 14.11 (in particular, the concavity of T> E (X) with E) we obtain 
Lemma 4.8. If E{X, T) > E then 



<T<^V E (X,T) 



E=E(X,T) 



E=E(X,T) 

where d + /dE (res. dT/dE) stands for the right (res. left) derivative. If E(X,T) =E then 



dE 



V E {X) 



< T . 



E=E 



We now define, for any X G Mo, a measure fi\ G Mf in the following way: 

Assume, for now, that A G M(A). If E(X,T) £]E_,oo[—N then define = fi A ' 

according to Definition 14.11 Otherwise, fix a sequence E n £]E_, oo[— N such that E n \ 

E(X,T). Similarly, let E n e]E,oo[-N such that E n / E(X,T). 



16 



Then /i^™ and /i^ n are given by Definition 14.11 for any n. Let fj,^ be a weak limit of the 
sequence /uf™, and, similarly, be a weak limit of the sequence fx® n . 
By Lemma 14.81 and Remark 14.11 we get 

dfJ-l <T < / dfi~ . (4.26) 

1 M JM 

If -E(A, T) = E_ then we can still define and it satisfies the left inequality of (|4.26p . 

Definition 4.3. For any A € A^o> tei /U^ defined in the following way: 
i) If X € M {A) then 



• If E(X,T) > E_ then fi\ is a convex combination ofT 1 /i^,T given by \^.26 ) 
such that fi\ € M.~l (that is, f dfi\,= 1). 

• // E(X, T) = E then 



,, \ T- 1 dfi+j m (4.27) 

where [xm i fi a Mather measure. 

ii) For A ^ .Mq(A), Zet X n € .Mo(-A) a sequence converging weakly to A. T/ien {/^a} is 
£/ie set of weak limits of the sequence fi\ n . 



Proof of Theorem (DdSMS]): 

Define 

G(A,a*):= sup {- / h(x,d<f>)dfi+ I <j)d\\ elU{oo} , Qr(X,fi) := Q(X,Tfi) . 

4>&C X (M) V JM JM J 

(4.: 



C T (A)= inf Q r (A,/x)= inf Q(X,Tfj,) . (4.29) 



4>gC 1 (m) 
Recall from [Tf^2] that 



Also, from (14.18J) . (14.15J) and Proposition |4~T1 

i?r(A) < Qt(A,m) Vfi£Mt . (4.30) 

We have to show that 

H* T (X)= inf Qt(A, h) (4.31) 

for any A € .Mo- It is enough to prove (|4.31|) for a dense set of in Mq, say for any A € Mo(A). 
Suppose (|4.3ip holds for a sequence {A n } C A4o(A) converging weakly to A € M.q, that 
is, H T (X n ) = Ct(A„). Since H T is weakly continuous by Corollary 14.31 we get H T {\) = 
lim-n^oo H T (\ n ). On the other hand we recall that, according to definition [2] of Theorem [H 
Ct : Mo i-> R is l.s.c. So lim n ^oo (?r(A n ) > C T (A), hence H* T (X) > C T (X). By (|429ll430l) we 
get (|4.3ip for any A G X . 

The proof of [2] ^O] then follows from 
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Lemma 4.9. For any X G M.q{A) 

Q T (X^ X ) = H* T (X) (4.32) 
holds where [i\ G is as given in Definition \4-3\ 

Proof. Let A G .MoO*) and £ G]E, oo[-iV. Then we use (gUPl for any G C^M) 
/i(x, #)a>f = - V A({x,y}) / j,(s)))ds 

We now perform a change of variables ds ^ dt = a' E (z^ y (s), z xy (s)) ds which transforms 
the interval [0,1] into [0,Tg;(x,y)] (see (|4.1ip ) and we get 

h{x,d4)di4 = - £ M{x,y}) [ TEX ' V h(z*Jt),d<t>(z* y (tf))dt 



where z xy is the re-parametrization of z£ y , satisfying z xy (0) = x, z xy (T E (x,y)) = y. Next 

/ <pdX= [ dAf(x,y)[0(y)-0(x)] = ]T A({x,y}) ^ dcf> (z* y (t)) z* y (t)dt 
J M jm „~r, Jo v 7 



so 



x,y£A 

Im dX ~ hi K x > d<PWf = 

■T E {x,y) r 



x,y£A 



Af({x,y}) f """" [d4^, v {t))^ y {t)-h^ y {t)M{^, y it 

— -A Jo 

< ]T Af({x,y}) r EiX ' y \(z^ y (t)Xy(t))dt= Yl Afax,y})C TE ^ y) (x,y) 



= £ Af({x,y})[C TB(:!;i3/) (x,y)+ J ET £ (x,y)] -E £ Af ({x, y})T £ (x, y) = 

x,y£A x,y£A 

Y Af ({x, y})D B (x, y) - £ £ Af ({x, y})T £ (x, y) = V E {X) - ED E {X) . (4.33) 

x,y£A x,y£A 

To obtain (|4.33p we used the Young inequality in the second line, (|4,12[) and (|4.22[) on the 
last line. 

Since ()4.33p is valid for any (j) G C 1 (M) we get from this and (|4.30p that 

V E {X) - ED E {X) > Q(A, /if) > H* T (X) = maxV E (X) - TE , (4.34) 

E^>E 

holds for any E > E. Now, if it so happens that the maximizer E(X, T) on the right of (|4.34p 
is on the complement of the set N in [E_, oof, then D E (X) = T = f M d[i^ for E = E(X,T) 
via Lemma 14.81 and the inequality in (|4.34p turns into an equality. Otherwise, if E(X,T) G 
N - {E}, we take the sequences E n / E{X,T), E n \ E(X,T) for E n ,E n e]E,oo[-N and 
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the corresponding limits /xt, /i, denned in (|4.26|) . Since Qt is a convex, l.s.c as a function 
of (j, we get that the left inequality in (|4.34|) survives the limit, and 

D E (x,T)W-E(\,T)^D E{ y T) (\) > Q(X,fj,+) , D E{ y T) (X)-E(X,T)^V E{XiT) (X) > Q(X,^[ 

(4.35) 

while j E -'D E (x,T)W = J ^Ma an< ^ %e^e(x,t){^) = / ^Ma- Then, upon taking a convex 
combination /*A = aT _1 /i| + T _1 (l — a)/i7 such that, according to Definition 14.31 

a—D E(xX) (X) + (1 - a)— £» £(Aj t)(A) = T J dfi x = T (4.36) 
and using the convexity of Q in we get from (|4.35l 14. 36[) 

^(A,T)( A ) ~TE(X,T) > Q(X, Tfi\) = Qt(A,ma) 

This, with the right inequality of (|4.32p yields the equality Qt(A,Ma) = H T (X). 

Finally, if E(X, T) = E_ we proceed as follows: Let E n s \ E_ and Ma := Yunn^^ /if . It 
follows that 

/ d/x+ = lim / d M f = lim D' En {X) = ^D E {X) G (0,T] . (4.37) 

Let /t A as in (14T271) . From (IQgl , OTj) and CLU we get 

Qt(A, ma) < S(A, 4) + (t- ^D E (X^j Q(0, m ) = Q(X, M +) - (t - ^D E (X)j E 

(4.38) 

while ([24]) and the left part of P~35|) for £ = E imply 

Q(A, p+) < D E (X) - E^D E (X) . (4.39) 

From (|4T38j) and (14391) we get 

Qt(A, ma) < D E (X) — ET < H* T (X) 

and the equality holds via (|4.30|) . The last part of Theorem [1] follows from the equality in 
(|O0j) as well. □ 

Proof of Theorem [3j 
Theorem [TJ- ([2]) and (13. 6p imply 

C T (A) = min C T (A||/t) . (4.40) 

Next, we note that 2?_£;(A||/i) is a concave function of -E for E > E. In fact, from (|3.4p and 
convexity of /i(x, •) for each x £ M we obtain 

0i G , i = 1, 2 => at^i + (1 - a)^ 2 £ W Q , £ ; 1+ (i_ Q ) £ ; 2 
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for a € (0,1) and E\,E2 > E. The concavity of "D(.)(A||/x) follows from its definition 
Then, by convex duality and (|3.6[) 



By the same argument 



V E (X\\n) = mm Ct (A ||//) + ET 



V E (X) = min C T (X) + ET 

T>0 L 



Hence, (|4.40p and Theorem [T]- dSJ) imply 



min T>E(X\\fi) = min min Cr(A||/i) + ET 



mm mm 

T>o 



C T (X\\fi)+ET 



mm 

T>o L 



C T (A) +ET = V E (X) . 



□ 



5 Proof of Theorems 



We start by the following auxiliary results: 



Lemma 15 . 1 1 follows from the surjectivity of Expf\x) as a mapping from T X M to M, for any 
x G M and any t ^ (Recall definition at Section I1.2| |8"|) : 

Lemma 5.1. Let A G 7W + (M x M). For any t > t/iere exists a 5ore/ measure 6 



.M + (TM) suc/i t/iaf [l®Exp 



"(f)) ^ W = A - ^ere J®EspjJj(a;,u) := (aj,J5xp[|j(2?,u)J. 
The proof of Lemma 15.21 follows directly from the definition of the optimal plan: 

Lemma 5.2. Lei A k a minimizer for $2.6)) . B C M x M a Borel subset and A[b t/te 
restriction of A to B. Let fi B , n B the marginals of A[b on the factors of M x M. Then A|__b 
is an optimal plan for C (/i^,/i^). In addition, if B±,B2 C M x M are disjoint Borel sets 
then 

^ A[biUB 2 * s ^ e optimal plan with respect to C (n Bl + Ms 2 ) + Mfio) • 
Lemma 15.31 represents the time interpolation of optimal plans (see [28J): 



an 



Lemma 5.3. Given t > and A = A 
reafcing 

C t (A+,A-) 



A £ .Mo- Let A* € 'P(A + ,A ) 6e an optimal plan 
C t (x,y)A t (dxdy) . 

Let A* G M + (TM) gwen m Lemma\5J\ for A = A*. Let X s := ^Exp^ A*. T/ien, i/ 
< s < t, 

C S (X + , X a ) + C t - S (A S , A") = C t (X + , A") . 
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Lemma 5.4. For any A + , A G M J~ satisfying A = A + 

C T (A+,A-) > C T (A) 

For the next Lemma we need: 

H) There exists a sequence of smooth, positive mollifiers 5 e : M x M — ► M + such that, for 
any G C°(M) (res. c/> G C^M)) 

lim 5 e * 4> = 4> 

where the convergence is in C°(M) (res. C l {M)) and for any e > and G C 1 (M) 

o~ e * d0 = d(<5 £ * (f>) . 

In addition, for any (x,p) G T*M and e > there exists 5 > such that h(x,£) — h(y,^ y ) < 
e(h(x,(,) + 1) provided D(x,y) < 5. Here £ y is obtained by parallel translation of (x,^) to y. 

Lemma 5.5. Ct(X\\p) is l.s.c in the weak-* topology of M.q x hA\ . Assuming H, for any 
A G M.q, p G M-i there exists a sequence {fin} = {Pn{x)dx} C Aif, {X n } = {p n (?n ~~ 
q~)dx} C .Mo where p n G C°°(M) are positive everywhere, q^ G C°°(M) non-negatives such 
that X n — * X, fL n — 1 /i and 

lim C T (A„ ||/2 n ) =C t (A||m) • (5.1) 
Lemma 5.6. For any [i G -M*, A = A + — A - G .Mo 

liminfe~ 1 C £ TG« + £A",/i + £A + ) > C T {X\\p) . 

Lemma 5.7. Assume p = p(x)dx and X = p(q + — q~)dx where p,q^ are C°° functions, p 
positive everywhere on M . Then 

limsupe~ 1 C e T' (u + eA~, p + eX + ) <Ct(X\\p) . 

Lemma 5.8. For T > 0, 

Ct(X) > limsupe -1 inf C £ t(p + eA + , p + eA~) . 

The proofs of lemma I5.4H5.8I are given at the end of this section. 

Proof of theorem [2j 

From Theorem [T]- (1) we get 

C eT (eX) = eC T (X) . 

We now apply Lemma I5"~4"| adapted to the case where |A ± | := f X^ ^ 1. Then 
C T (A + ,A-) = |A ± |C T (^,^ >|A±|C T (^) =C T/ , A± |(A) . 



- A- G M 
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Note that J d/j, + edX^ 1 = 1 + 0(e), hence 

e- x C eT (^ + eX+^ + eX~) >C Te (X) 
where T £ — > T as e — > 0. Hence 

liminf inf £ _1 C E r (/x + eA + ,/x + eA~) > C T (X) . 

e^O M + 

The Theorem follows from this and Lemma 15.81 
□ 

Proof of Theorem |4) 

We have to show that for any (/u, A) £ M.f x M.q and any sequence (/u n ,A n ) — (/i, A) as 
n — > oo: 

liminf nC T/n (/z n + n -1 A+,/i n + n _1 A~) >C(A||/i) (5.2) 

and, in addition, there exists a sequence (fi n , X n ) — ^ (/i, A) for which 

lim nC T/n (ji n + n -1 A+,/i n + n _1 A~ ) = C(X\\fi) . (5.3) 

The inequality (|5.2I) follows directly from Lemma 15.61 To prove (I5.3|) . we first consider the 
sequence (ft n ,X n ) subjected to Lemma [531 From Lemma [5771 and Lemma [531 



lim limsup nCWj (A? + n ^tiP-i + n AA ) < lim Ct (Ai||/2 3 -) = C(X\\fi) 
So, there exists a subsequence j n along which 

limsup nC T/n (p, jn + n _1 A+ ,/2 jn + n^A^ ) < C(A||/i) . 

n — >na \ * 



This, with ([572]), implies (|5T3j) . 

The second part of the theorem follows from (|5.2p and Theorem [5J 
□ 

Proof, of Lemma \5.4\ We use the duality representation of the Monge-Kantorovich functional 
[26] to obtain (recall X ± e M^) 



C T (X-,X + ) + ET = sup j / 



V>dA~ - 0dA+ , - xj){x) < C T (x, y) + ET 



By ([27101) Ct(», y) + ET > D E (x, y) for any x,y £ M so, by (127121 127T3D 



sup <^ / xl)d\- - (f>dX + , (j>{y) - V(x) < C T (x, y) + ET}> sup \ / 0dA , <f>(y) - </>(x) < D E (x, y) 
i/>,4> Um J <f> I J M 

= V E {X) (5.4) 
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so 

C T {\~,\ + ) > V E {\)-ET 
for any E > E. By Theorem [T}© 

C T (\-, A+) > sup V E {\) — ET = C T (\) . 

E>E 



Proof, of Lemma \5.5l From (|3.5l 13. 6p we obtain 



□ 



Ct(A||/x) = sup / <j)d\ — Th(x,dcf))dfj, . 

(j>&C 1 (M)JM 

In particular Ct is l.s.c (and convex) on A4q x j\4^ . 

Let e n — > and A n := A £n := S £n * A G A4o denned by 

^dA n := A(5 £n * V) W> € C°(M) . (5.5) 

M 

By H, A„ — 1 A while A n are have smooth density. First, we observe that lim^^oo A n — A. 
Indeed, for any ip G C l {M): 



n—*oo 



lim / tpdX n = lim A (S Sn * ip) = ■ 



Next, by Jensen's Theorem and H again 



h(x,dS £ * (p)dfi = / h(x,5 £ * d(f))dn < / h(x,d(j)(y))5 £ (x,y)dfj,(x)dy 

M JM JMxM 

= I h(x,d<t>)dS £ * n+ [h(x,d(p(y)) - h(y,d(j)(y)]5 e (x,y)dfi(x)dy (5.6) 

JM JMxM 

From section [L~2l -(i7l) and using 5 £ (x,y) = o(l) for D(x,y) > 5, 



[h(x,d$(y)) -h(y,d(f>(y)]8 e (x,y)dy(x)dy <0(e)+o(1) / h(x,d<p)d5 E * n . 

M x M JM 

Next, define fi n = 5 £n * \x. Let ip n be the maximizer of C(X n \\/i n ), that is 

Cr(A n ||/i n ) = / ip n d\ n -Th(x,dip n )dn n 

JM 



By (|53ll53 



Cr(A„||/i„) < / 5 e * ?MA - (1 - o(l)) / T/i(z,d$ e *Vn)d/i + 0(e n ) 
'm Jm 



(l-o(l)) 



5e*i>n- ttt - / Th(x,dd £ *ip n )diJ, 

M 1 - o(l) Jm 
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(5.7) 



We obtained 

limsupCT(A n ||// n ) < C T (X\\n) 



n^oo 



which, together with the l.s.c of Ct, implies the result. □ 
Proof, of Lemma \5.b\ Recall that the Lax-Oleinik Semigroup acting on <p G C°(M) 

ip(x,t) = LO(4>)n x ) := sup [4>{y) - C t (x,y)} 

is a viscosity solution of the Hamilton-Jacobi equation dtip — h(x, dip) = subjected to 
■tpo = <j){x). If <f> G C 1 (M) then ip is a classical solution on some neighborhood of i = 0, so 

lira LO(<j))(T,-) = > ^ m T~ [LO((f))(T,x) ~ 4>i x )] = h(x,d(f)) . 
Then for any pi,p2 € A^f 

Cr(^i,/"2)= sup <^ / 4>diM 2 - i)dnx ; <j){x) -ip{y) <C T {x,y) Vx,yGM 



sup / ^d)U2 - LO(<j))f T)X \dfii (5.8) 

Hence 

liminf £~ 1 C £ t{p + £X~,fi + eX + ) = 

£—♦0 

liminf sup / e' 1 [<f>(x) - LO{4>), £Tx) ] dfi + / <pdX + - LO{<j))u Tx) dX~ 

> sup lira / e^ 1 [^(x) - LO(0) (e2> )] d/i + / 4>dX + - LO{(j))f £T}X) dX~ 
<t>£C 1 (M) £ ^°JM Jm 

= sup / — Th(x, d(j))diJL + <f>dX := Ct(X\\/j,) . (5.9) 

□ 

Proof, of Lemma \5. 7| We may describe the optimal mapping : M — > M associated with 
C £ t(p + eA~,/x + eA + ) in local coordinates on each chart. It is given by the solution to the 
Monge- Ampere equation 

dpt\7 o_ p(x)(l+eq-{x)) 

where 

V^ = -V x C £T (x,S £T (x)) (5.11) 

and 

C eT (p + eX-,p + eX + ) = [ C £T (x,S £T ( x ))p(l+eTq-)dx (5.12) 

JM 
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We recall that the inverse of S/ x C e t(x, •) with respect to the second variable is Id + eTVip, 
to leading order in e. That is, 

V x C eT (x, x + eTd p h(x, + (eT) 2 Q(x, £, e)) = -£ (5.13) 

where (here and below) Q is a generic smooth function of its arguments. 
Hence, S £ t can be expanded in e in terms of ip as 

S eT (x) = x + eTh^x, Vip) + (eT) 2 Q(x, V^, e) (5.14) 

We now expand the right side of (|5.10p using (|5.14p to obtain 

1 + eT [q~{x] - q + (x) - ht(x,dif>) • V x lnp(x)] + (eT) 2 Q(x, Vip, x, e) (5.15) 

while the left hand side is 

det(V x S eT ) = 1 + eTV ■ h^x, dip) + (eT) 2 Q(x, Vip, Wip, x, e) (5.16) 

Comparing (|5.15l I5.16|) , divide by eT and multiply by p to obtain 

TV • (phe(x, dtp)) = p{q~ - q+) + eT P Q(x, Vip, VVip, x, e) . (5.17) 

Now, we substitute e = and get a quasi- linear equation for ipQ : 

TV • (p/^Oc, # )) = - q+) . (5.18) 

■00 is a maximizer of 



C T (A||/i) = / p{q + -q )ipa~ / pTh{x,dips)dx 
Jm Jm 

By elliptic regularity, ipo G C°°(M). Multiply (|5.18p by an d integrate over M to obtain 

p{q + -q~)=j pTh(.{x,dipQ) -VipQ 

M JM 
Then by the Lagrangian/Hamiltonian duality 

C T (\\\n)= f pT[ViPo-h(x,dip ) -h(x,d^ )} =T [ pi (x, h € (x, #o)) • (5.19) 
Jm Jm 

We observe I (x, ^) > T^Cr^y). So, (I5TT2D with (IfTTiD imply 

(er)- 1 C £T (// + eA-,^ + eA + ) < / p(l + sTq~)l (x, h^(x, Vip £ + eTQ(x, Vip e ,e)) (5.20) 

Jm 

where ip £ is a solution of (|5.17p . Now, if we show that lim e ^oV'e = V'o hi C 1 (M) then, from 
(I5TT91 15301) 

limsup(e) _1 C eT (p + e\~ , p + e\ + ) <T pi (x, h^(x, dip )) = C(X\\p) . 
e^o ' Jm 
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Next we show that, indeed, lim^oV'e = V'o hi C (M). 
Substitute ip B = ipo + 4> £ hi f)5. 1T|) . We obtain 

V • (a(x)V<f> e ) = eQ(x, V0 £ , VV0 £ , e) + V • (p(VVe, Q(x, V</>, e) • V0 e >) (5.21) 

where cr := Th^(x,'V^o(x)) is a positive definite form, while Q is a smooth matrix valued 
functions in both x and e, determined by V^o and Q as given in (|5.17p . A direct application 
of the implicit function theorem implies the existence of a branch (\(e),r] £ ) of solutions for 

V • (a(x)Vr]) = eQ(x, Vr/, VVr/, e) + V • (p(V'r/, Q(x, V77, e) o V??}) + A(e) (5.22) 



where r/o = A(0) = and e 1— > r/ £ is (at least) continuous in C (M) _L 1. Note that for e / 
we may have a non-zero A(e) which follows from projecting the right side on the equation to 
the Hilbert space perpendicular to constants (recall that M is a compact manifold without 
boundary, and the left side is surjective on this space). We now show that rj £ = (j) e , i.e 
A(e) = also for e^O. Indeed, (|5.21|) is equivalent to (|5.1U|) multiplied by p(x)/e, so (|5.22|) 
is equivalent to 

detV x S eT = - y + ep (x)X(e) 

p(S £T (x))(l+eq+(S £T (x)) 

where S £ t(x) obtained from (|5.14p with ip £ := ipo + r\ £ . 
Hence 

/ (p(S eT (x))(l + eq + (S £T (x))) det(V x S £T ) = [ (p(x)(l + eq-(x))) 

JM v 7 JM 

+ £ X(e)[ p{S f[ x) \ l + e q + (S £T (x)) (5.23) 
Jm p{x) 

However, S £ t{x) = x + 0(e) is a diffeomorphism on M, so 

/ (p(S £T (x))(l+eq + (S £T (x))) det(V x S eT ) = [ (p(S £T (x))(l + Tq + (S £T (x))) \det(V x S eT )\ 

J M J JM v 7 



p(x)(l+eq + (x)) = p(x)(l+eq-(x)) . (5.24) 

M JM 



It follows that 



eA(e) f pA^)l {1 + £q+{ s Mx)) = o. 

Jm P{x) 



Since p is positive everywhere it follows that A(e) = for |e| sufficiently small. We proved 
that r] £ = <f) £ and, in particular, cf> £ — > as e — > in C 1 _L 1, which implies the convergence of 
V> £ to at e -> in C 1 _L 1. □ 

Proof, (of Lemma 15 .8|) Given e > let 

L>|(x, y) := inf [C £nT (z, y) + enET] . (5.25) 
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Evidently, D £ E (x,y) is continuous on M x M locally uniformly in E > E. Moreover, 

lim D% = D E (5.26) 

e\0 

uniformly on M x M and locally uniformly in E > E as well. 

We now decompose J\d x Ad into mutually disjoint Borel sets Qn- 

MxM = U n Q £ n , Q e n n Q £ En , = iin^n 

such that 

Qn C {(x, y) G M x M ; D^i, y) = C £nT (x, y) + en£T} . 
Let € "P(A + , A - ) be an optimal plan for 

V%{\)= I D E (x,y)dA? = min / D%(x,y)dA , (5.27) 

and A" = A^" |_Q e > the restriction of A^ to Q e n . Set A^ to be the marginals of A™ on the first 
and second factors of M x M. Then Y^=i^e = ^-e an d 

oo 
n=l 

Remark 5.1. Abie f/mi Q e n = /or a// but a finite number of n G N. In particular, the sum 
\5. 28\) contains only a finite number of non-zero terms. 

Let |A n | := J M dX^ = J MxM dA^. The averaged flight time is 

oo 

{Tf :=eT^n|A n | (5.29) 

n=l 

We observe that (T) £ E 8e'D e (X), where Be is the super gradient as a function of E\ At this 
stage we choose E depending on e, T such that 

(T) e = T + 2eT|A ± | (5.30) 

We now apply Lemma [5TT1 Recalling Section [L2] |8l let A™ € M + (TM) satisfying 
(l®Exp { ^ EnT) \ A n £ = A™. Use A" to define Xi := ^Exp^ £nT) ^j A™ G A4+(M) for 
jf = 0, 1 ... n. Note that 

\i = > = \i • (5.31) 



By Lemma [ 

n-1 



C £TlT (A+ A~) + £ n£T|A n | = [C eT (A J n ,A{ +1 ) +^T|A n |] (5.32) 

j=0 
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Prom ([5351 15371 E2S1 and Lemma [ 

co oo oo n— 1 

P|(A) = X>I(V) = E [CWt(A+,AJ + en£T|A„|] = £ £ (C £T (A£, A£ +1 ) + e£T|A n |) . 

n=l n=l n=l j=0 



(5.33) 



Let now 

co n— 1 



n=l j=l 



Note that 



n=l j'=0 n=l n=l 

By we obtain 



oo 

,e,E 



eJ2(n + l)\\±\ -2e|A ± | = 1 fj, £ ' E € M\ . (5.34) 



n=l 



By (^28ll53T|) 

co n— 1 / co n— 1 co n \ / co n— 1 oo n 

X; E ^(A£,, a^ +i ) >c £ r EE A -EE A ^ +1 = e ^ e EE A ^> e E E A ^ +1 

n=l j'=0 \ n=l j=0 n=l j=l J \ n=l j=0 n=l j=l 

= £~ 1 C eT {ijl £ ' E + e\ + ,fi £ > E + e\~) . (5.35) 

From (15391 15331 15351 , 15341) 

£>|(A) - (T) £ E > e~ l C £T {ff' E + e\ + , n e > E + eA~) > e" 1 inf C eT (n + eA + , /i + eX~) . 

(5.36) 

Finally, Theorem Q}3, (153611530115361) imply 

Cr(A) > T>e(X) —TE = Hm2?|;(A) — (T) e E > limsupe -1 inf C eT Ui + eA + , /i + eA~) . 

□ 

References 

[1] Luigi Ambrosio, L and Pratelli, L: Existence and stability results in the LI theory of 
optimal transportation, Lect. Notes in Math, 1813, (2003) 

[2] V. Bangert: Minimal measures and minimizing closed normal one-currents, GAFA, 9, 
413-427. (1999) 

[3] Bernard, P: Young measures, superposition and transport, Indiana Univ. Math. J. 57 $ 
1, 247-275, (2008) 



28 



[4] Bernard P. and Buffoni B: Optimal mass transportation and Mather theory , Journal of 
the European Mathematical Society, to appear. 

[5] Bernard P. and Buffoni B: Weak K AM pairs and Monge-Kantorovich duality, Adv. Study- 
in Pure Mathematics, 47, 397-420, (2007) 

[6] Bouchitt, G., Buttazzo, G. and Seppecher, P.: Energies with respect to a measure and 
applications to low dimensional structures. Calc. Var. 5, (1997), 37 - 54 

[7] Bouchitt, G., Buttazzo, G. and Seppecher, P.: Shape optimization solutions via Mon- 
geKantorovich equation, CRAS 324 (1997), 1185- 1191. 

[8] Buttazzo ,G. and Stepanov, E.: Optimal transportation networks as free Dirichlet regions 
for the Monge-Kantorovich problem. Ann. Scuola Norm. Sup. Pisa CI. Sci., (5) 2 (2003), 
631-678. 

[9] Bernot, M, Caselles, V and Morel, J.M.: Optimal Transportation Networks, Lect. Notes 
in Math., 1955, Springer- Verlag (2009) 

[10] G. Contreras, J. Delgado and R. Iturriaga: Lagrangian flows: the dynamics of globally 
minimizing orbits. II, Bol. Soc. Brasil. Mat. (N.S.) 28 , # 2, 155-196, (1997) 

[11] Evans, L.C: A survey of partial differential equations methods in weak KAM theory, 
Comm. Pure Appl. Math. 57 4, 445-480, (2004) 

[12] Evans L.C. and Gangbo W: Differential equations methods for the Monge-Kantorovich 
mass transfer problem, Mem. Amer. Math. Soc. 137, (1999) 

[13] Evans L.C and D. Gomes, D: Effective Hamiltonians and averaging for Hamiltonian 
dynamics. I., Arch. Ration. Mech. Anal. 157 1, 1-33, (2001). 

[14] Evans L.C and D. Gomes, D: Effective Hamiltonians and averaging for Hamiltonian 
dynamics. II. Arch. Ration. Mech. Anal. 161 , 4, 271-305, (2002) 

[15] Fathi A: Weak KAM Theorem in Lagrangian Dynamics, Cambridge Studies in Advanced 
Mathematics (in press) 

[16] Fathi A: Solutions KAM faibles conjuguees et barrieres de Peierls, C. R. Acad. Sci. Paris 
Ser. I Math. 325 , 649652 (1997) 

[17] Fathi A. and Sicololfi A: PDE aspects of Aubry-Mather theory for quasiconvex hamilto- 
nians, Cal. Var. 22, 185-228 (2005) 

[18] Fathi, A and Siconolfi, A: Existence of C l critical subsolutions of the Hamilton- J acobi 
equation. Invent. Math. 155 , # 2, 363-388, (2004) 

[19] Iri, M: Theory of flows in continua as approximation to flows in networks, in Survey of 
Math Programming (ed. by Prekopa), North-Holland, 1979. 

[20] Mather J.N: Action minimizing invariant measures for positive definite Lagrangian sys- 
tems, Math. Z. 207 , 169-207 (1991) 



29 



[21] Mane R: On the minimizing measures of Lagrangian dynamical systems, Nonlinearity 5, 
623-638, (1992) 

[22] Mane R: Lagrangian flows: The dynamics of globally minimizing orbits, Bol. Soc. Bras. 
Mat, 28, 141-153, (1997) 

[23] Strang, G: Maximal flow through a domain, Math. Programming 26, 123-143 (1983) 

[24] Strang, G: L\ and approximations of vector fields in the plane, in Lecture Notes in 
Num. Appl. Analysis 5 273- 288 (1982) 

[25] Rockafeller, R.T: Convex Analysis, Princeton, N.J, Princeton U. Press 

[26] Villani C: Optimal Transport: Old and New, Grundlehren der mathematischen Wis- 
senschaften, (2008) 

[27] Siburg, F: The Principle of Least Action in geometry and Dynamics , Lecture Notes in 
Mathematics 1844, Springer, 2004. 

[28] Villani C: Optimal Transport: Old and New, Grundlehren der mathematischen Wis- 
senschaften, (2008) 

[29] Wolansky, G: Minimizers of Dirichlet functionals on the n-torus and the weak KAM 
theory, Ann. de l'Inst. H. Poincare (C) Non Linear Anal., in press 

[30] Wolansky G: Extended least action principle for steady flows under a prescribed flux, 
Calc. Var. Partial Differential Equations 31, 277-296, (2008) 



30 



